Multiple Uses of Frequent Episodes in Temporal Process Modeling

نویسنده

  • Debprakash Patnaik
چکیده

(ABSTRACT) This dissertation investigates algorithmic techniques for temporal process discovery in many domains. Many different formalisms have been proposed for modeling temporal processes such as motifs, dynamic Bayesian networks and partial orders, but the direct inference of such models from data has been computationally intensive or even intractable. In this work, we propose the mining of frequent episodes as a bridge to inferring more formal models of temporal processes. This enables us to combine the advantages of frequent episode mining, which conducts level wise search over constrained spaces, with the formal basis of process representations, such as probabilistic graphical models and partial orders. We also investigate the mining of frequent episodes in infinite data streams which further expands their applicability into many modern data mining contexts. To demonstrate the usefulness of our methods, we apply them in different problem contexts such as: sensor networks in data centers, multi-neuronal spike train analysis in neuroscience, and electronic medical records in medical informatics. Acknowledgments It is my great pleasure to thank the people who made this thesis possible. Dr. Naren Ramakrishnan, my thesis advisor, has been my greatest source of inspiration in this entire journey which now culminates in this dissertation. Every step of the way he has provided valuable insights into the data mining problems. He has helped me understand the different subtleties of data mining methods and probabilistic models. I am deeply grateful to Dr. Srivatsan Laxman who has spent innumerable sleepless nights sharing with me valuable intuitions about the different data mining problems addressed in this dissertation. He has been a great friend and has helped with words of encouragement and advice every time I hit a wall trying to solve a problem. His positive attitude and his faith in me have been instrumental in the completion of this thesis. I want to thank Dr. Manish Marwah for giving me the opportunity to work on some of the most challenging problems in knowledge discovery. With his help and guidance we were able formulate a data mining approach to an important problem in sustainability. His eye for detail and continuous persuasion to better our methods has lead to significant improvement in the quality of this work. I am indebted to Dr. T. M .Murali and Dr. Yang Cao for their unfailing support as my thesis committee members. I am thankful for their technical insights which have helped me overcome some of …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DYNAMICS OF MUDBANKS ALONG A COAST EXPERIENCING RECURRING EPISODES OF EROSION AND ACCRETION

The morphological states of Guyana’s coastal system, at various spatial and temporal scales, are found to be influenced by the formation and migration of mudbanks. Stationary and propagating mudbanks along the coast are investigated with the use of multiple data sources, including aerial photographs, satellite imagery, GPS measurements, and a time series of coastal profile data (1941-1987...

متن کامل

Analyzing and modeling urban sprawl and land use changes in a developing city using a CA-Markovian approach

Mashhad City, according to the latest official statistics of the country is the second populated city after Tehran and is the biggest metropolis in the east of Iran. Considering the rapid growth of the population over the last three decades, the city’s development area has been extended, significantly. This significant expansion has impacted natural lands on suburb and even some parts e.g. rang...

متن کامل

Statistical Inference of Functional Connectivity in Neuronal Networks using Frequent Episodes

Identifying the spatio-temporal network structure of brain activity from multi-neuronal data streams is one of the biggest challenges in neuroscience. Repeating patterns of precisely timed activity across a group of neurons is potentially indicative of a microcircuit in the underlying neural tissue. Frequent episode discovery, a temporal data mining framework, has recently been shown to be a co...

متن کامل

Determination of Spatial-Temporal Correlation Structure of Troposphere Ozone Data in Tehran City

Spatial-temporal modeling of air pollutants, ground-level ozone concentrations in particular, has attracted recent attention because by using spatial-temporal modeling, can analyze, interpolate or predict ozone levels at any location. In this paper we consider daily averages of troposphere ozone over Tehran city. For eliminating the trend of data, a dynamic linear model is used, then some featu...

متن کامل

Iranian TEFL Graduates’ Conceptions of Measurement Error in Research: A Genealogical Narrative Inquiry

The aim of this study is to investigate Iranian TEFL graduates’ conception of measurement error in research. Adopting a sequential explanatory multi-method strategy (Borg, 2009), the researchers analyzed causal and temporal relations in the research narratives elicited from 30 TEFL graduates. Gee’s (1986) framework for identifying narrative discourse units (lines, stanzas, and episodes) was ado...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011